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Abstract 

Experiments built to search for neutrinoless double beta-decay are limited in their 
sensitivity not only by the exposure but also by the amount of background encoun- 
tered. Radioactive isotopes in the surrounding of the detectors which emit gamma- 
radiation are expected to be a significant source of background in the GERmanium 
Detector Array, GERDA. 

Methods to select electron induced events and discriminate against photon in- 
duced events inside a germanium detector are presented in this paper. The methods 
are based on the analysis of the time structure of the detector response. Data were 
taken with a segmented GERDA prototype detector. It is shown that the analysis 
of the time response of the detector can be used to distinguish multiply scattered 
photons from electrons. 
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1 Introduction 



Radioactive decays in which photons with energies above Q = 2039 keV are 
emitted are expected to be a significant source of background for the GER- 
manium Detector Array, GERDA [lj. GERDA is an experiment which is 
currently being constructed and has as aim the search for the neutrinoless 
double beta-decay (0v/3(3) of the germanium isotope 76 Ge. 
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Methods to distinguish between electrons and multiply scattered photons us- 
ing the time structure of the germanium detector response, or pulse shape, are 
presented in this paper. The pulse shape depends on the location and the spa- 
tial distribution over which energy is deposited inside the detector in a single 
event. Photons in the MeV-energy region will predominantly Compton-scatter 
and deposit energy at locations separated by centimeters. These events are re- 
ferred to as multi-site events. In contrast, electrons in the same energy region 
have a range of the order of a millimeter. Events of this kind are referred to 
as single-site events. 

Pulse shape analysis methods have been developed for nuclear experiments 
such as AGATA [2] and GRETA [3] as well as for double beta-decay ex- 
periments [4.5.6.8.9J. In the context of the latter these techniques are now 
extended to segmented detectors. In this study the focus is on the pulse shape 
analysis after the application of a single segment requirement as presented 
in [TU]. The performance of the pulse shape analysis with and without seg- 
ment information is compared based on data taken with an 18-fold segmented 
GERDA prototype detector. 

The experimental setup and the collected data sets are described in Section [2j 
The accompanying Monte Carlo simulation is introduced in Section [31 A pa- 
rameter accessible in simulations which is a measure of the volume over which 
energy is deposited inside the detector is defined. A definition of single-site 
and multi-site events is derived from the Monte Carlo data sets based on this 
parameter. The fraction of single-site and multi-site events in the data sets 
is estimated. Three analysis methods are presented in Section H] and these 
methods are applied to the data sets taken with the prototype detector. The 
results are summarized in Section [5j Conclusions are drawn in Section [61 



2 Experimental setup and data sets 

2.1 Experimental setup and data taking 

The segmented germanium detector under study is the first segmented GERDA 
prototype detector. The true coaxial cylindrical crystal has a height of 70 mm, 
an outer diameter of 70 mm and a central bore with a diameter of 10 mm. It 
is 18-fold segmented with a 6-fold segmentation in the azimuthal angle <fi and 
a 3-fold segmentation in the height z. It was operated in a conventional test 
cryostat. Signals from the core and the segment electrodes were amplified and 
subsequently digitized using a 14-bit ADC with a sampling rate of 75 MHz. 
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The energy and the pulse shapes of the core and the 18 segment electrodes 
were recorded for each event. The pulse shape data consists of 300 13.3 ns 
samples of the integrated charge amplitude. The onset of the signal was de- 
layed by one /xs. The (full width at half maximum) energy resolution of the 
core electrode was 2.6 keV at energies around 1.3 MeV, the energy resolutions 
of the segment electrodes ranged from 2.4 keV to 4.8 keV with an average 
segment energy resolution of 3.3 keV. Details of the experimental setup and 
the detector performance can be found in [TT] . 



A 100 kBq 228 Th source was placed at z = cm and r = 17.5 cm with respect 
to the detector center (z — cm, r = cm) facing towards the center of a 
segment, S, located in the middle row. Two data sets were taken with dif- 
ferent trigger conditions labeled TRc and TR$. The former trigger condition 
requires the core electrode to show an energy above 1 MeV. The collected 
data set is referred to as core data set and contains 127000 events. The latter 
trigger condition requires segment S to show an energy above 1 MeV. The col- 
lected data set is referred to as segment data set and contains 420 000 events. 
As an example, Figure [1] shows a pulse shape measured with the core (left) 
and with the segment S electrode (right) for an event in the segment data set. 
The core-energy spectra will be shown in Section 15.31 
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Fig. 1. Pulse shape measured with the core (left) and with the segment S electrodes 
(right) for an event in the segment data set. The energy of 1758 keV seen in the 
core is completely contained in segment S. The starting time is chosen arbitrarily in 
this example. The amplitude is in arbitrary units but the scale is the same for both 
pulse shapes. The pulse shapes are dominated by different charge carrier types. 
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2.2 Event selection 



A pre-selection applied to the segment data set collects events with energy- 
deposited only in one segment. It requires the energy measured in segment S 
to be the same as the energy measured in the core within ±5 keV, according 
to about ±4 a given the energy resolution. In total, 150 396 events fulfill the 
pre-selection criterion. 

Four data samples each are selected from the core and segment data sets. The 
data samples are defined by the energy measured in the core and are labeled: 

• DEP: The sample contains events with a core energy in the region of 
(1593 ± 5) keV. These events are associated with the double escape peak 
of the 2615 keV 208 T1 photon. The photon produces electron-positron pairs 
of which the positron subsequently annihilates. Both 511 keV annihilation 
photons escape the detector. The energy is predominantly deposited on a 
millimeter-scale; i.e., locally. 

• T\: The sample contains events with a core energy in the region of 
(1620 ±5) keV. These events are associated with photons of this energy 
produced in the decay of 212 Bi. The photons mostly scatter multiple times 
before their energy is fully deposited inside the detector. 

• T 2 : The sample contains events with a core energy in the region of 
(2615 ± 5) keV. These events are associated with photons of this energy 
produced in the decay of 208 T1. The photons mostly scatter multiple times 
before their energy is fully deposited inside the detector. 

• ROE. The sample contains events with a core energy in the region of interest, 
(2039 ±50) keV. These events are predominantly associated with Compton- 
scattered photons from 208 T1. 

The requirements of the trigger, pre-selection and event selection are listed 
in Table [TJ Also the number of events in the corresponding data samples 
are shown. The amount of background in each data sample, as estimated from 
taking spectra without the 228 Th source present, was found to be less than 1%. 



3 Monte Carlo simulation 

The GEANT4 [12] based MaGe [13] framework was used to simulate the pro- 
totype detector setup (for details and a validation of this particular simulation 
see [10]). A Monte Carlo study was performed to estimate the spatial distribu- 
tion over which energy is deposited in the detector for events in the different 
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Table 1 

Requirements of the trigger, pre-selection and event selection, and the number of 
events in the corresponding data samples. Eq and E$ are the energies seen in the 
core and in segment S, respectively. 



Cut 


Condition 




Events 


Trigger (TR C ) 


E c > 1 MeV 




127000 


Pre-selection 






127000 


Selection (DEP) 


\E C ~ 1593 keV| < 


5 keV 


1673 


Selection (Ti) 


\E C - 1620 keV| < 


5 keV 


1965 


Selection ^2) 


\E C - 2615 keV| < 


5 keV 


22 924 


Selection (ROI) 


\E C - 2039 keV| < 50 keV 


6431 


Trigger (TR S ) 


E s > 1 MeV 




420 000 


Pre-selection 


\E C - E s \ < 5 keV 




150 396 


Selection (DEP) 


\E C ~ 1593 keV| < 


5 keV 


3492 


Selection (ri) 


\Ec ~ 1620 keV| < 


5 keV 


1972 


Selection ^2) 


\E C - 2615 keV| < 


5 keV 


19 243 


Selection (ROI) 


\E C - 2039 keV| < 50 keV 


7707 



data samples. A 228 Th source was simulated. The trigger, pre-selection and 
event selection requirements discussed in the previous section were applied 
to the Monte Carlo data. The data sets are referred to as core and segment 
Monte Carlo data sets. 

A measure for the spatial distribution over which energy is distributed in- 
side the detector is the radius Rgo- This is defined as the radius inside which 
90% of the energy in a single event is deposited; for a detailed discussion 
see [7]. Figure [2] shows the distribution of -R 90 for the DEP, Ti, T 2 and ROI 
samples for the core (left) and segment (right) Monte Carlo data sets. All dis- 
tributions are normalized to unity. The Rgo distributions range from 0.1 mm 
(log 10 (i?9o) = —1) up to 7 cm (log 10 (-Rgo) — 1-8). The DEP samples are domi- 
nated by events with Rgo in a region from 0.1 mm to 1 mm. A long tail towards 
larger radii is visible and mostly due to events in the underlying Compton- 
shoulder of 208 T1 and events in which electrons undergo hard bremsstrahlung 
processes. The Rg distributions for the l~i and ROI samples have two promi- 
nent regions each, one at radii from 0.3 mm to 1 mm and a second from 
3 mm to 6 cm. The latter one is due to multiply scattered photons whereas 
the former is due to photons with higher energy which only scatter once and 
then leave the detector. The Rg distributions for the T 2 samples range from 
0.3 mm to about 7 cm with a maximum at around 2 cm for the core Monte 
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Carlo data sample and at around 1 cm for the segment Monte Carlo data 
sample. The sample is dominated by events in which photons scatter multiple 
times. No peak at small Rqq is visible. 

It is expected that the single segment requirement rejects events with large 
values of Rqq. Indeed, the distributions of -R90 in the segment Monte Carlo data 
samples are suppressed in the region above 1 cm. The peaks between 0.1 mm 
and 1 mm in the DEP, Ti and ROI samples are more pronounced in this case. 




log i0 (R 90 [mm]) lo 9 10 ( R 9 o [mm]) 

Fig. 2. Normalized distributions of R$o for the DEP, T\, T2 and ROI samples for 
the Monte Carlo core (left) and segment data sets (right). Single-site events (SSE) 
and multi-site events (MSE) are defined by requiring Rqq < 2 mm and Rqo > 2 mm 
(dashed line) as discussed in the text. 

Single-site and multi-site events are defined by requiring R 9Q < R and -R90 > 
R, respectively, where R is a chosen parameter value. The distributions of 
-R90 for the DEP samples suggest R = 2 mm (log 10 (i?) = 0.3). Also, due to 
the sampling rate of 75 MHz and the average drift velocity of charge carriers 
(O(10 8 ) mm/s) energy deposits closer than about 2 mm cannot be resolved. 
The fractions of single-site events in the Monte Carlo data samples are thus 
defined and summarized in Table Also listed are the corresponding system- 
atic uncertainties of the fractions which are derived by varying the parameter 
R by ±1 mm. 

The Monte Carlo data samples are not purely composed of single-site or multi- 
site events. The DEP samples are dominated by single-site events, the Ti and 
T 2 have large fractions of multi-site events. Events in the DEP samples are 
referred to as electron-like while events in the Ti and T 2 samples are referred 
to as photon-like in the following. Note, that these two labels do not describe 
an intrinsic property of an event (such as the range of energy deposition), 
but they are used to emphasize the different probabilities of the event being 
single-site or multi-site. 
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Table 2 

Fractions of single-site events in the Monte Carlo data samples. The errors are 
derived by varying the parameter R by ±1 mm. 



Monte Carlo data samples 


DEP 


Ti 


r 2 


ROI 




(1593 keV) 


(1620 keV) 


(2615 keV) 


(2039 keV) 


Core samples 


(77.91^)% 


(30.5it°)% 


(12.21 


(52.4±?;|)% 


Segment samples 


(89.01^)% 


(55.0±|;2)% 


(30.011°;°)% 


(77.61^)% 



4 Analysis methods 



Three analysis methods were tested. Half of the DEP and 1~\ data samples 
were used to train the methods. The other half of the samples, together with 
the T2 and ROI samples, were used to test the analysis methods. The DEP 
and Ti samples were selected for training in order to avoid biases due to the 
difference in energy of events in the two samples. For the same reason the 
maximum of each pulse shape was normalized to unity for each event. 

The analyses were applied to the core and segment data samples in order to 
study the effect of pulse shape analysis before and after the application of a 
single segment requirement. In the former case, only the core pulse shape was 
used. In the latter case, the core pulse shape was used and, optionally, the 
segment S pulse shape in addition. 

4-1 Likelihood discriminant method 

Four quantities are calculated for each pulse shape. These quantities provided 
separation power in previous studies [8f9] . Interpolation algorithms were ap- 
plied to the pulse shapes to obtain continuous distributions. Figure [3] shows 
an ideal pulse and the quantities calculated are indicated. All quantities are 
given subscripts C and S for the core and segment pulse shapes, respectively. 

• Risetime Tio-3o> defined as the difference between the times the integrated 
charge amplitude has reached 10% and 30% of its maximal amplitude; 

• risetime 710-90, defined as the difference between the times the integrated 
charge amplitude has reached 10% and 90% of its maximal amplitude; 

• left-right asymmetry (, defined as the asymmetry of the area below the 
left and the right half of the current pulse, A\ and A r , measured from the 
maximum^, ( = 

1 The definition differs from the one given in |8|9j . 
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current pulse width 5, defined as the full width at half maximum of the 
current pulse. 




Fig. 3. Ideal pulse shape: the integrated charge (thick line) and the current (thin 
line). Indicated are the quantities T10-30, i"io-90) 5, A\ and A r (see text). 

The variables are histogrammed for both training samples and their integrals 
are normalized to unity. As an example, Figure H] shows the normalized distri- 
butions of the four quantities calculated from the core pulse shape in the two 
segment data samples. The average risetime of pulses in the DEP sample is 
larger than that of pulses in the Ti sample The relative frequencies are used 
to define discriminants, given that the event is electrondike (DEP sample) or 
photondike (T\ sample). The respective overall discriminants, p e - and p*y , are 
calculated by multiplying the individual discriminants: 



Pe- =p(TL0-3o,*|e ) -P(no-90,fc|e )-p(Ck\e ) ■ p(5 k \e ), (1) 

P*=p(no-3oM ■ p(no~9o,k\l) -p(Ck\l) -p(Skh) , (2) 

with k = C or S for the core and segment pulses, respectively. Note that no 
correlations among these quantities are taken into account. 

Likelihood discriminants (LHD) are constructed from p e - and p 7 for each 
individual event: 



DC = ^T^i (3) 

Pe- + Py 

C S 

TjC+S _ Pe- ■ Pe~ / 4 \ 

Pe- • Pe- + P7 • P7 

2 This behavior was also found in a simple calculation of pulse shapes assuming a 
perfect crystal and not taking into account any effects from the electronics. 
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where D c uses information from the core electrode only and D c+S uses in- 
formation from the core and segment electrodes. D varies between and 1 by 
construction. D peaks at 1 for electron-like events; for photon-like events D 
peaks at 0. Events are identified as electron-like for D > D and as photon-like 
for D < D, where D is a chosen parameter. 

90.12 
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Fig. 4. Quantities calculated from the core pulseshapes in the DEP (open his- 
togram) and Ti (hatched histogram) segment data samples. Top left: risetime T10-30, 
top right: risetime T10-90, bottom left: left-right asymmetry £, bottom right: current 
pulse width 5. 



4-2 Library method 



The training DEP samples are interpreted as libraries of electron-like refer- 
ence pulses. An average x 2 w hh respect to all reference pulses is calculated 
for each pulse shape in the test samples. For the kth reference pulse and the 
/th pulse shape under study the average x 2 is defined as 

JV i=i 
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where N is the number of bins of the pulse shapes and Xk,i and the 
pulse heights in bin i of the kth reference pulse and the Ith pulse under study. 
a 2 is defined as 



a 2 = a 2 + a 2 , (6) 



where o"k and o\ are the noise amplitudes of the reference pulse shape and 
the pulse shape under study. The noise amplitude is the RMS of the baseline 
measured during the one /xs before the onset of the pulse. 



The minimum x 2 is selected with respect to the reference pulses and denoted 
Xmin = Xk ■ i f° r eac h pulse shape in the test sample. Ideally, the minimum 
X 2 for electron-like events should be smaller than that of photon-like events. 
Events are identified as electron-like for x min < x 2 an d as photon-like for 
Xmin > X 2 ) where x 2 is a chosen parameter. 



4-3 Neural network method 



Artificial neural networks (ANNs) are used to separate electron-like from 
photon-like events. Input neurons are fed with samples of the normalized pulse 
shape, starting from the time when the amplitude has reached 10%. 40 consec- 
utive samples per pulse shape are used. The ANN consists of 40 input neurons, 
40 hidden neurons and one output neuron for the core data samples. An ad- 
ditional 40 input neurons are used optionally for the segment data samples. 



The ANNs are trained by feeding them with pulse shapes from the two training 
samples and simultaneously providing the information which of the samples 
each pulse belongs to (0: DEP sample, 1: I\ sample). The ANNs adjust the 
internal neurons iteratively using the Broyden, Fletcher, Goldfarb, Shanno 
(BFGS) learning method [TJ]. Each ANN is trained in about 1000 iterations. 
The output quantity, NN, is on average larger for photon-like events than for 
electron-like events. Events are identified as electron-like for NN < iViV and 
as photon-like for iViV > iVW, where iViV is a chosen parameter. 
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5 Results 



The three analysis methods are applied to the data samples defined in Sec- 
tion 12.21 The likelihood discriminant and neural network analysis are per- 
formed on the segment data samples (a) with information from the core elec- 
trode only and (b) with information from the core and the segment S electrode. 
As an example, Figure [5] shows the output distributions for the two segment 
training data samples DEP and T\ for the likelihood method (left), the library 
method (middle) and the neural network (right). The segment pulse shapes 
have not been taken into account for these examples. 




0.2 0.4 0.6 0.8 1 5 10 15 20 25 30 -0.2 0.2 0.4 0.6 0.8 1 1.2 



Fig. 5. Output distributions for the two segment training data samples DEP (open 
histograms) and T\ (hatched histograms) for the likelihood method (left), the library 
method (middle) and the neural network (right). The segment pulse shapes were 
not taken into account in these examples. 

The results of the analysis are interpreted in the following. First, it is shown 
that the electron-like and photon-like event samples can be distinguished. In 
a second step, the results are interpreted to distinguish between single-site 
and multi-site events. The estimate of the power of such a distinction requires 
the knowledge of the fraction of single-site and multi-site events in the data 
samples. That information is taken from the Monte Carlo simulation presented 
in Section [3] based on the parameter R 90 . 

5.1 Distinction between electron-like and photon-like event samples 

The power to distinguish between electron-like and photon-like event samples 
is estimated. The events in the DEP sample are assumed to give the same 
output in the analyses as events from neutrinoless double beta-decay. The cut 
values are chosen to keep 90% of the events in the DEP training samples for 
the three analysis methods and thus a high detection efficiency The fraction 
of events in each test data sample identified as electron-like are summarized in 
Table [31 The uncertainties are estimated from the deviation from 90% of the 
fraction of events identified as electron-like in the DEP test data samples and 
found to be about 2%. Note that no deviation is found in case of the library 
method since the DEP training data sample is used as a reference library. 
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Table 3 

Fraction of events in the test data samples identified as electron-like for the three 
analyses. The uncertainties are estimated to be about 2%. 



Data samples 


DEP 


Ti 


r 2 


ROI 




(1593 keV) 


(1620 keV) 


(2615 keV) 


(2039 keV) 


Likelihood method 


Core samples 


89.3% 


76.5% 


75.4% 


83.4% 


Segm. samples, core only 


89.3% 


67.1% 


64.1% 


84.8% 


Segm. samples, core & segm. 


88.0% 


66.7% 


61.1% 


83.4% 


Library method 


Core samples 


90.0% 


86.9% 


85.8% 


86.7% 


Segm. samples, core only 


90.0% 


68.4% 


56.4% 


83.1% 


Neural network method 


Core samples 


90.4% 


65.8% 


63.2% 


79.9% 


Segm. samples, core only 


89.3% 


54.1% 


44.3% 


80.8% 


Segm. samples, core & segm. 


89.3% 


56.1% 


49.9% 


79.6% 



The fraction of events identified as electron-like is significantly lower than 90% 
in the T\, T 2 and ROI samples. The fraction in the I\ sample is found to be 
larger than that in the T 2 sample with each method. This is expected, as the 
mean free path of photons increases with the photon energy. 



The fraction of events identified as electron-like in the l~i and T 2 segment data 
samples (using the core pulse shape only) is found to be lower than that in 
the core data samples with all three methods. The additional usage of the 
segment pulse shape in the analyses reduces the fraction by maximally 3%; 
in case of the neural network it even increases the fraction by maximally 5%. 
This demonstrates that the additional information is highly correlated with 
the existing information and only marginally contributes to the analysis. 



The neural network shows the best performance. This is expected, since the 
ANN uses the largest fraction of information and also takes correlations be- 
tween input variables into account. 
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5.2 Selection of single- site events and discrimination against multi-site events 



As demonstrated in Table [2j neither the DEP nor the T\, T2 and ROI sam- 
ples are solely composed of single-site or multi-site events. The probability 
to correctly identify single-site and multi-site events as such, e and r/, can be 
deduced from the fraction of single-site and multi-site events in each sample 
(estimated from Monte Carlo) and the output of the analyses, D, x m in> NN: 



~ N %j N TsE " M %J M i 

_ N% SE /N t fJ - M^ E /M t 



where N^ d and N£[ are the number of events in the DEP sample iden- 
tified as single-site and multi-site events, respectively. The numbers depend 
on the cut value chosen for each analysis. N^ E and N^f e are the true num- 
ber of single-site and multi-site events in the same sample and are estimated 
from the Monte Carlo simulation discussed in Section [3j Mf d and are 
the number of events in the Ti sample identified as single-site and multi-site 
events, respectively. M t f^f and M^ E are the true number of single-site and 
multi-site events in the same sample. The probabilities e and rj are assumed 
to be the same for all samples. This assumption is reasonable for the DEP 
and Ti samples as the average energies are very close. 



The cut values for the three analysis methods are chosen to maximize the 
figure of merit, the identification efficiency ^Je ■ n. Note, that these cut values 
differ from those used in Section [57T1 The probabilities obtained from the data 
samples using Equations [7] and M are listed in Table HI 

The likelihood and library methods work better on events with only one seg- 
ment hit. The additional usage of the segment pulse shape in the likelihood 
method does not improve the analysis results. 



The analysis of the neural network output yields probabilities larger than 
one for the segment data samples. The calculation of e and rj depends on 
the real fraction of single-site and multi-site events and is therefore model 
dependent. The current model assumes the fraction of single-site and multi- 
site events to be completely reflected by the parameter R 90 . The validity of 
the assumed model is limited and the extraction of the probabilities e and rj 
carries systematic uncertainties. The results should be taken with care. The 
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Table 4 

Probabilities e and ry obtained for all three analysis methods. The errors are in- 
troduced by the choice of R determining the fraction of single-site and multi-site 
events. 



Analysis 




€ 


V 




V 


e • rj 




Likelihood method 


Core samples 




(74.81 


0.3) /o 


( 84.71 




(79.6 


+1.4\ 
-0.2) 


% 


Segm. samples, core 


only 


(84.31 




( 97.71 


W 5 .l)% 


(90.8 


-1.9J 


% 


Segm. samples, core 


& segm. 


(83.91 




( 94.01 




(88.8 


+4.6n 
-1.8.1 


% 


Library method 


Core samples 




(68.71 


83)% 


(56.ll 




(62.1 


+0.7\ 
-Q.2J 


% 


Segm. samples, core 


only 


(90.91 


13 °i)% 


(SO^l^i^o 


(85.6 




% 



Neural network method 



Core samples 




(85.61^)% 


( 91.0j 


- 4.3 W 

- 0.3) /o 


( 88.3±55)% 


Segm. samples, 


core only 


(96.4±g;|)% 


(121.61 


-15.0w 
- 8.5) /o 


(108.3±|;i)% 


Segm. samples, 


core & segm. 


(90.61^)% 


(115.41 


-13.4w 
- 7.7 ) /o 


(102.31^)% 



efficiencies do not exceed unity for the chosen cut parameter for the core data 
samples. Figure [6] shows e and rj together with the identification efficiency as 
a function of the neural network cut parameter for the core data samples. 




-0.4-0.2 0.2 0.4 0.6 0.8 1 1.2 1.4 

Cut 

Fig. 6. Probabilities to correctly identify single-site, e, and multi-site events, 77, 
and the efficiency, ^Je ■ 77, for the neural network analysis of the core data samples. 
Probabilities above one are caused by uncertainties in the extraction process. 
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5.3 Application to the 228 Th data set 



Figure [7] (left) shows the energy spectrum resulting from a 228 Th source in 
the region from 1.3 MeV to 2.7 MeV as seen by the core electrode. The black 
line corresponds to all events with only segment S hit, the gray line represents 
events with only segment S hit and pulse shape analysis, using the ANN, ap- 
plied. Only the pulse shape of the core was used and the cut parameter was 
chosen to keep 90% of the events in the DEP training data sample. 



The gray spectrum is suppressed with respect to the black spectrum. The 
suppression ranges up to a factor of about two at the photon peaks. The sup- 
pression is weak in the double escape peak. Figure [7] (right) shows a close-up 
of the spectrum in the region from 1560 keV to 1650 keV. The application 
of the pulse shape analysis removes photon induced events (1620 keV photon 
line from the decay of 212 Bi) but keeps most of the electron induced events 
(double escape peak of the 2 615 keV 208 T1 photon at 1593 keV). Pulse shape 
analysis is thus suitable to confirm the signal process. 
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Fig. 7. Spectrum of a 228 Th source as seen by the core electrode. The black line 
corresponds to all events with only segment S hit, the gray line represents events 
with only segment S hit and pulse shape analysis, using the ANN, applied. Only 
the pulse shape of the core was used and the cut parameter was chosen to keep 90% 
of the DEP events. Left: Spectrum from 1.3 MeV to 2.7 MeV. Right: Close-up of 
the region from 1560 keV to 1650 keV. For a discussion see text. 
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6 Conclusions and outlook 



Three methods using pulse shapes were introduced to distinguish electrons 
from multiply scattered photons. They were applied on data collected with 
a GERDA prototype detector. Single-site dominated samples were distin- 
guished from multi-site dominated samples. The probability to correctly iden- 
tify single-site and multi-site events was estimated based on Monte Carlo 
calculations. 

All three methods were trained with double escape events and events from 
a nearby photon peak. The former events are expected to be similar to the 
expected Oz//3/3-events. 

The methods are based on information from the core electrode and may in- 
clude information from the segment electrode or not. The power to identify 
photon induced events does not increase with the straightforward inclusion of 
the pulse shape of the segment. 

The performance of the three methods is slightly worse than what was re- 
ported in [9J. A reason for this is the purity of the samples. Also, the spatial 
distribution of energy deposited inside the detector is not homogeneous in 
the DEP sample. Methods to select cleaner and more homogeneous training 
samples are currently being tested. 

The artificial neural network shows a better performance than both the like- 
lihood discriminant and the library method. Photon peaks remaining after a 
single segment cut are suppressed by a factor of about two at energies around 
1.5 MeV. At the same time 90% of the events in the single-site dominated 
sample are kept. This demonstrates that the association of a particular peak 
with the signal process can be substantiated by this kind of analysis. 

The calculation of the efficiency to correctly identify single-site and multi-site 
events is limited by the assumed model based on the Rgo parameter. Fur- 
ther studies are required; in particular, a simulation of the development of 
pulse shapes is important and is currently under development. Studies using 
additional information from neighboring segments to distinguish single-site 
from multi-site events are also planned. In addition, an improved experimen- 
tal setup is planned. 
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The rejection of events in the 1620 keV peak using segment anti-coincidences 
as presented in [10] is about a factor of two better than the sole application of 
pulse shape analysis as presented in this paper. Nevertheless, the application 
of pulse shape analysis after a single segment cut can further reject events in 
this peak by an additional factor of about two. 
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